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Entanglement can effectively increase communication channel capacity as evidenced by dense 
coding that predicts a capacity gain of 1 bit when compared to entanglement-free protocols. However, 
dense coding relies on Bell states and when implemented nsing photons the capacity gain is bounded 
by 0.585 bits dne to one’s inability to discriminate between the four optically encoded Bell states. 

In this paper we study the following question: Are there alternative entanglement-assisted protocols 
that rely only on linear optics, coincidence photon counting and separable single photon input states 
and at the same time provide a greater capacity gain than 0.585 bits. We show that besides the Bell 
states there is a class of bipartite four-mode two-photon entangled states that facilitate an increase 
in channel capacity. We also discuss how the proposed scheme can be generalized to the case of 
two-photon N-mode entangled states for N = 6,8. 

PACS numbers: 03.67.Hk, 03.65.Ud 


I. INTRODUCTION 

Dense coding is a quantum communication technique 
that allows one to send two bits of classical information 
per single qubit transmitted over a quantum channel. 
This becomes possible when a sender and a receiver pre¬ 
share a maximally entangled two-qubit state (Bell state). 
Of course, if no entanglement is shared, at most one bit of 
classical information can be communicated by sending a 
single qubit. Therefore, using entanglement as a commu¬ 
nication resource increases channel capacity by 1 bit. The 
original protocol [1] - closely followed in all optical qubit 
implementations [2-5] - was designed for generic qubits 
and has two main requirements. First, the sender must 
be able to generate all four Bell states from any given Bell 
state via a local (single-qubit) operations only. Second, 
the receiver must be able to perform an unambiguous Bell 
state discrimination (typically using a two-qubit CNOT 
gate). For optical qubits - most suitable for a long dis¬ 
tance communication - the first requirement can be read¬ 
ily fulfilled by combining spontaneous parametric down 
conversion entangled photon sources with linear optical 
devices such as beam splitters, polarization rotators and 
phase shifters. However, the second requirement of the 
original protocol is very stringent. Unfortunately, one 
cannot deterministically distinguish all four Bell states 
only by means of linear optical devices and coincidence 
measurements [6]. Either hyper entanglement [3-5, 7] 
or additional entangled ancillae are needed [8, 9] making 
all-optical implementations challenging. As a result, the 
entanglement-assisted channel capacity gain is bounded 
by 0.585 < 1 bits. But even achieving the reduced bound 
is fairly difficult due to experimental imperfections. For 
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instance, the actual channel capacity gain reported in [2] 
was ss 0.13 bits. 

This prompted us to ask the following question: Are 
there alternative entanglement-assisted protocols (not 
based on Bell states) that utilize only linear optics, coin¬ 
cidence photon counting and input single photon prod¬ 
uct states and, at the same time, provide a greater ca¬ 
pacity gain than 0.585 bits? We answer this question 
by constructing a communication protocol that satisfies 
the above resource constraints. In particular, we find 
an equivalence class of four-mode two-photon entangled 
states that can be prepared using product single photon 
input states with linear optical elements and transformed 
into each other by means of linear optical operations on 
(two) “local” modes of one of the parties and, at the same 
time, can be discriminated by a photon coincidence mea¬ 
surement deterministically. We formalize the problem 
mathematically in Section II and show that the explicit 
structure of these states as well as the detailed experi¬ 
mental setup that implements our protocol can be ob¬ 
tained by maximizing the mutual information between 
the sender and the receiver over all physical input states, 
local/global operations and detection schemes. By solv¬ 
ing the optimization task numerically in Section III, we 
find that the communication channel capacity of our pro¬ 
tocol with ideal detectors is 2 bits. 

To verify that entanglement does indeed provide a gain 
in the channel capacity we determine the upper bound on 
the channel capacity with respect to all possible resource- 
equivalent entanglement-free protocols. We show in Sec¬ 
tion III that, under the condition of no vacuum detection, 
no entanglement-free protocol can achieve the channel ca¬ 
pacity greater than 1 bit and, thus, our protocol allows 
the sender to communicate an extra 1 bit of classical in¬ 
formation which is better than the gain of 0.585 bits of¬ 
fered by dense coding with linear optics. 

On the other hand, we demonstrate that if vacuum de¬ 
tection is allowed but detectors are imperfect then our 
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protocol still provides a detection-efliciency-dependent 
channel capacity gain. For example, the gain is « 
0.27 bits for the state-of-the-art superconducting single 
photon detectors. Also, our protocol can be extended 
to the case of two photons shared among iV > 4 modes 
and in the Section IV we consider two scenarios with 
N = 6 and N = 8. We show that in both cases solu¬ 
tions exist that provide a capacity gain over correspond¬ 
ing entanglement-free protocols. 


II. CHANNEL CAPACITY FORMALISM FOR 
FOUR-MODE TWO-PHOTON 
COMMUNICATION PROTOCOLS 

When implementing an abstract two-qubit system us¬ 
ing single photons two so-called “dual rail” schemes 
are prevalent. The first one is the polarization encod¬ 
ing where the logical zero and one states of each qubit 
are realized as the horizontally \H) and vertically \V) 
polarized single photon in a given spatial mode and 
the second is the spatial mode encoding where a sin¬ 
gle photon placed in either one of two spatial modes 
i.e. |0,1) and |1,0) represents the logical zero and one 
states. The schemes can be mapped onto each other 
by setting \H) = |0,1),|V) = |1,0). An arbitrary lo¬ 
cal (involving modes of one of the parties only) oper¬ 
ation can be performed by means of linear optical ele¬ 
ments such as a polarization rotator or a beam splitter 
in the case of dual rail encoding. This class of opera¬ 
tions will map the two-qubit computational space = 
span{|0,1,0,1), |0,1,1,0), |1, 0,0,1), |1, 0,1,0)} onto it¬ 
self. However, when implementing an operation involv¬ 
ing spatial modes from both parties with linear op¬ 
tics an input state from may end up in a larger 
Hilbert space For example, consider the action of 

a 50/50 beam splitter between modes 2 and 3 on the 
state \ip) = |0,1) 0 |1,0) € C^. The resulting state 
\'4>) = ;^(|0, 0,2,0) — |0, 2, 0,0)) actually lies outside of 
in a larger Hilbert space In fact, this becomes 
an issue when one tries to implement a quantum com¬ 
puter using only linear optical transformations. But for 
quantum communication problems it may be more ad¬ 
vantageous to operate in the full (two photons in four 
modes) Hilbert space 

Indeed, consider the following communication proto¬ 
col implemented using N = A modes and n = 2 photons 
(see Fig. 1). The source generates a special initial two- 
photon four-mode state I'l'in) G and sends half of the 
modes to Alice and the other half to Bob. For the sake 
of concreteness we assume that Alice gets modes labeled 
1 and 2 (1 through JVa if JV modes are shared). As a 
result, Alice and Bob share the state |'kin)- Upon re¬ 
ceiving her part of the state, Alice transforms her modes 
using one of the four (or potentially more) predetermined 
two-mode unitary transformations Ui,i = I,-- - ,4. She 
chooses which unitary operation Ui to apply according to 
a probability distribution p{Ui) and does not disclose her 
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FIG. 1: (Color on-line) Illustration of the proposed 
entanglement-assisted communication protocol. Alice and 
Bob share a two-photon four-mode state distributed 

by the source. Alice performs a “local” mode transformation 
Ui on her two modes and sends them to Bob who performs a 
coincidence detection on all four modes and estimates index 
i. 


choice to Bob. Next, Alice sends her part of the state to 
Bob who now has one of four possible two-photon four¬ 
mode states \tpi) with probability p{\tpi)) = p{Ui)- Bob 
wants to learn which unitary transformation Ui was per¬ 
formed by Alice i.e. which state |'0i) he has at hand. 
To do that he sends all four modes through a detection 
setup that performs a four-mode unitary transformation 
Usob and measures the projection of the output state 
onto the two-photon four-mode Fock basis states {|(/i) = 

12000), |</.2) = IllOO),-- - ,|(/9) = 10011), l^io) = 10002)}. 

Using the outcome of the measurement Bob tries to guess 
the index i of the state that Alice has sent. 

In qubit-based communication protocols Alice receives 
a single qubit (i.e. photon) from the source. The pho¬ 
ton, depending on the encoding scheme, is distributed 
over two (polarization or spatial) modes. The local uni¬ 
tary operations Ui,i = 1, • • • ,4 that Alice performs on 
her qubit do not change the total number of photons she 
communicates to Bob. This provides an implicit con¬ 
straint on the average number of photons used to com¬ 
municate a single character i.e. 

4 

+ O.A2aA2\'4’i) = 1 , ( 1 ) 

i=l 

where oai and aA 2 are photon annihilation operators for 
Alice’s mode 1(2) and pi is the probability that Alice 
performs an operation Ui. In our qubit-less protocol the 
source can potentially provide a state |4>i„) that violates 
the constraint in Eq.(l). To make our protocol compara¬ 
ble to dense coding with linear optics in terms of photonic 
resources needed to communicate a single character we, 
therefore, explicitly require that the state provided by 
the source to Alice and Bob is such that it satishes the 
photon number constraint in Eq.(l). 

Naturally, the best case scenario in terms of informa¬ 
tion transmission(assuming a noiseless quantum chan¬ 
nel and perfect detectors) for our protocol is when Al¬ 
ice can prepare four orthogonal states |V'i) subjected to 
constraints Eq.(l) and Bob can unambiguously detect 
which state \^pi) Alice has sent him just by means of 
photon coincidence measurements. In this case, analo- 
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gously to the original dense coding proposal [1], Alice 
and Bob can communicate two bits of information (pro¬ 
vided Pi = jVi) by sending on average one photon in 
just two modes instead of four. However, whether such 
states lipi) (equivalently a state I'l'in) and two-mode uni- 
taries Ui) exist is an open question. Also, to conclude 
that our protocol is indeed of a dense coding type, we 
need to determine the largest amount of classical infor¬ 
mation that Alice and Bob can share by utilizing at most 
two single photons and two modes under the constraint 
Eq.(l). Only if the latter is less than 2 bits our proto¬ 
col demonstrates an information gain. To answer these 
questions we need to formulate the problem in terms of 
communication channel capacity determination. 

From the information-theoretical perspective Alice and 
Bob form a two-mode two-photon communication sys¬ 
tem with two auxiliary modes in which Alice encodes 
each letter of her message using a two-bit alphabet X = 
ll'i/'i), • • • , |t^ 4 )} with probability p{ipi) = p{Ui). The re¬ 
ceiver, Bob, detects the message sent by Alice as a collec¬ 
tion of random signals from the set y = • • • , |<('io)} 

with the conditional probability p{y\X) = p{'ilji\<pj) = 
Kcpiltpj)]^,i = I,-- - ,10;j = I,-- - ,4 and applies a de¬ 
coding rule to estimate the original message. Note that 
p(y\X) is a function of the initial state Alice’s 

unitary operations Ui,i = I,-- - ,4 and Bob’s detection 
setup UBob- In this context, finding the highest rate (in 
bits) at which information can be sent from Alice to Bob 
is equivalent to determining information channel capac¬ 
ity of the Alice-Bob system. The information channel 
capacity C of Alice-Bob channel is defined as [10], 

C ='niaxl{'>p; (j)), (2) 

where the maximization is performed over all possible in¬ 
put distributions p(^/’i), states unitary mode trans¬ 

formations Ui, Usob and is subjected to the constraint in 
Eq.(l); I is the mutual information, 

EE ■ (3) 

PWj) 

Here pi^’j^'t’k) = p{'^j\4>k)p{4>k) denotes the joint proba¬ 
bility of Alice preparing the state jf/'j) and Bob detecting 
the state \4‘k)- The marginal probability p^/pk) is defined 
asp((/)fc) = E^^.p(V’4,</>fc)- 

By definition, the mutual information /(f/'; (j)) is a con¬ 
cave function of p(V’i) (3 independent real-valued pa¬ 
rameters) and a non-concave function of the unitary 
mode transformation matrices Ui G U(2),i = I,-- - ,4 
(16 real-valued parameters), and Usob G S'C/(4) (15 in¬ 
dependent real-valued parameters) and the input state 
that we parametrize using ten complex parame¬ 
ters Ck (18 independent real-valued parameters [12]) as 
10 

l^in) = X) Cfc|(/)fc). We can always set one of the matrices 

k=l 

Ui to be the identity 11 matrix which will leave us with 
only three independent 2x2 unitary matrices. Therefore, 



Dense 

Proposed 


Coding 

Protocol 

Entanglement 

Bell States 

Multi-mode Entanglement 

^ of photons 

1 

1 

sent by Alice 


on average 

Total ^ of photons 

2 

2 

Total # of modes 

4 

4 

Operations 

Linear Optics 

Linear Optics 

Detection 

Coincidence 

Coincidence 


TABLE I: Resource overview for the proposed entanglement- 
assisted protocol and conventional dense coding with photonic 
qubits 


the total number of real-valued optimization parameters 
in Eq.(2) is 48. 

Since we set Alice’s alphabet to only four letters, the 
global maximum of the channel capacity C over all pos¬ 
sible physical setup parameters cannot in principle ex¬ 
ceed log2(4) = 2 bits. This follows form the definition 
of the mutual information in Eq.(3). We observe that 
for any two random variables X and y : I{X-,y) = 
H{X) — H{X\y), where H denotes Shannon entropy [10]. 
Since H > 0, the maximum of I{X;y) is achieved when 
H{X) is maximal (= log 2 (|T|)) and H{X\y) is minimal 
(=0) i.e. maxJ(A;A’) = log 2 (|T|). However, it is not 
clear if this bonnd is physically attainable. Also, due 
to the non-concave nature of the optimization objective 
function many local maximums may exist. Of course, 
when optimizing /(^; (/>) numerically we are interested in 
finding a supremum of all local maximums and hope that 
it is 2 bits. Note that because I is concave in parameters 
piipi), if the global (2 bits) maximum is attained, using 
the preceding argument one can immediately show that 
the only possible values of p{ipi) = jVi. Therefore, we 
can further reduce the number of real optimization pa¬ 
rameters to 45 by setting p{tpi) = We conclude this 
section by providing an overview of the resource require¬ 
ments for our entanglement-assisted communication pro¬ 
tocol as well as for the photon-based dense coding proto¬ 
col in Table I. The only difference between the protocols 
comes from the number of photons that Alice communi¬ 
cates to Bob. In our protocol this number is one on aver¬ 
age. Whereas in the dense coding protocol Alice always 
sends Bob one photon. This requirement also restricts a 
class of resource-equivalent entanglement-free protocols 
to those that use one photon on average. 

III. FOUR-MODE TWO-PHOTON PROTOCOL 
ANALYSIS 

A. Optimization Results 

First, to test our approach, we solved the optimiza¬ 
tion problem in Eq.(2) using a fixed state |'I'i„) provided 
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by the source. We set !'!'„) to be equal to one of the 
Bell states (it does not matter which Bell state is se¬ 
lected, optimization works equally well for all of them) 
and found by numerical optimization that in this case 
C = log 2 3. Moreover, Alice’s mode transformation ma¬ 
trices that correspond to this solution are the same as the 
ones originally proposed by Bennett and Wiesner [1]. It 
means that by setting the initial state to a Bell state the 
conventional Bell state-based dense coding protocol [1, 2] 
is recovered. 

Next, we have discovered, by using gradient-based op¬ 
timization methods, that the global maximum (C = 
2 bits) is indeed achievable in The structure of 

globally optimal solutions encountered in our numerical 
search can be parametrized as follows. All globally opti¬ 
mal input states I'l'in) prepared by the source are, up to 
a swap of any two modes, equivalent to the state, 

| 4 -„) = ^(| 1 , 1 , 0 , 0 ) + 10 , 1 , 1 , 0 )+ 11 , 0 , 0 , 1 ) 

+ 10,0,1,1)). (4) 

For example, the following input state 

!§) = ^(|i,o,i,o) + |o,i,i,o) + |i,o,o,i) 

+ 10,1,0,1)), (5) 

obtained from I'l'in) by swapping modes 2 and 3 also 
leads to the globally optimal solution with C = 2 bits. 

Moreover, l^^in) in Eq.(4) also defines a class of globally 
optimal input states that are equivalent to |'I'j„) up to a 
four-mode unitary transformation: 


Ut = 


Ua 0 

0 Ub 


( 6 ) 


where Ua,b are arbitrary unitary matrices S U{ 2 ), 


Ua = 

cos 9i 

sindi 

gi</3 gjji 

gl(02-|-<l3-dl) QOS 01 

Ub = 

gi04 gQg ^2 

—sin 02 

g*<l 6 gjj^ ^2 

QOS 02 


(7) 

( 8 ) 


and parameters 6 * 1 ^ 2 , ,6 are arbitrary angles G [ 0 , 27r]. 

Given matrices Ua and Ub, Alice’s globally optimal 
mode transformation matrices (acting on modes 1 and 2 ) 
can be decomposed as Ui = U^^Uc, U 2 = —U^^azUc, 
U 3 = —C/ 2 , C /4 = C /2 • C/ 3 , where Ua is defined in Eq.(7), 
Uc is an arbitrary 2 x 2 unitary matrix G U ( 2 ) with a 
similar parametrization and cr^ denoted Pauli sigma Z 
matrix. 

Lastly, Bob’s four-mode transformation matrix UBob 
can be represented as follows, 




c/; 


0 


0 ui 


10-10 
010-1 
101 0 
010 1 


(9) 



FIG. 2: (Color on-line) Linear optical implementation of 
the proposed entanglement-assisted protocol. Blue rectan¬ 
gles represent 50/50 beam splitters and transparent squares 
represent 180° phase shifters 


Implementing this mode transformation matrix in an ex¬ 
periment, Bob will detect four distinct coincidence pat¬ 
terns: a coincidence between detectors in modes 1 and 
2 correspond to Alice’s choice C/i, modes 2 and 3 cor¬ 
respond to C/ 2 , modes 3 and 4 correspond to C/ 3 , modes 
1 and 4 correspond to C/ 4 . We remark that this out¬ 
come mapping is not unique. Other choices of coinci¬ 
dence assignment are possible and can be realized by 
additional four-mode unitary rotation on Bob’s end i.e. 

UBob ^ UBob ‘ Uswap- 

We emphasize that Ut, Ui,--- , C /4 and UBob define 
globally optimal unitary mode transformation. The re¬ 
sult of their action on the input state I'l'in) can be deter¬ 
mined in the following fashion. Let us denote input mode 
photon creation operators ds a\,k = - ■ ,4, then, 

l^m) = ^(44 + 44 + 44 + 44)|o): (lo) 

where |0) is a four-mode vacuum state. Consider first 
the unitary mode operation Ut defined earlier. It acts 

on the creation operators aj) as a linear transform i.e. 

4 

at —^ ^ {Ut)kjO,], where {Ut)kj are the matrix elements 
1=1 

of Ut- As a result, the input state jiFm) in Eq.(lO) is 
transformed to 

1 4 

1 ^*) = i^{Y.^{Ut)u{Ut)2j + {Ut)2^{Ut)^, + {Ut)u{Ut)A, 

//=i 

+ (t/t)3,(C/*)44]44)|0)- ( 11 ) 

Similarly, unitary mode transformations Ui, - - - ,Ui and 
UBob can now be applied to the state lift) in Eq.(ll) in 
sequence, resulting in the desired output state at Bob’s 
detectors. 

The structure of the globally optimal solution is most 
transparent when Ua = Ub = Uc = 1. In this case the 
physical setup that implements our communication pro¬ 
tocol is illustrated in Fig. (2). Surprisingly, its structure 
is equivalent to a double Mach-Zehnder interferometer. 
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The source prepares the state IT^) in Eq.(4) by plac¬ 
ing a separable state containing two single photons in 
modes 1 and 2 i.e. |1,1,0,0) onto two 50/50 beam split¬ 
ters coupling modes (1,3) and (2,4). To transform her 
two modes, Alice needs only 180° phase shifters. Note 
that the transformation U 3 applies a phase shift to both 
Alice’s modes with respect to Bob’s modes. Bob recom¬ 
bines modes (1,3) and (2,4) on two 50/50 beam split¬ 
ters and sends them for the coincidence detection. In 
the case of noiseless channel and perfect detectors this 
converts the states IV’i) —> |1,1,0,0), [^ 2 ) —>■ |0,1,1,0), 
IV’ 3 ) |0, 0,1,1), |■i/ 4 ) —)■ |1, 0,0,1) which results in the 

channel capacity (7 = 2 bits. Also note that photon num¬ 
ber resolving detectors are not required by Bob since he 
relies on a coincidence detection. 


B. Channel Capacity of Resource-Equivalent 
Entanglement-Pree Protocols 

Let us now calculate the channel capacity for the case 
when Alice and Bob do not share any entanglement in 
order to determine whether our entanglement-assisted 
protocol provides any advantage for information trans¬ 
mission purposes. For that we consider the following 
resource-equivalent scenario: Alice prepares a two-mode 
state IV'i),* = I,"' corresponding to a character 
from a M-letter alphabet X {M is to be determined 
later), with probability p{'4’i) and sends it to Bob who 
can perform an arbitrary two-mode unitary transforma¬ 
tion UBob 2 before he detects the received state using two 
photo detectors that are not number resolving (because 
the correspondent entanglement-based protocol does not 
require photon-number-resolving detectors). 

To facilitate a fair comparison between the 
entanglement-free and entanglement-assisted proto¬ 
cols, we assume that Alice’s resources per character 
are limited to at most two single photons and she may 
use only linear optical elements(i.e. only two-mode 
unitary operations) to prepare states {ipi). In this case, 
inputs to Alice’s state preparation setup are limited to 
|0,0), |0,1), |1,0), |I, 1). Linear optical operations that 
Alice may perform to prepare states lipi) conserve the 
total number of photons and, thus, \'ipi) cannot be a su¬ 
perposition of states with different total photon numbers 
such as, for instance, |0,0) -I- |I,1) or |0,0) -I- |0,1), etc. 
Also, the states \ipi) must be mutually orthogonal. If 
they are not. Bob will be forced to discriminate between 
non-orthogonal states since he can only apply a two¬ 
mode unitary transformation (unitary transformations 
do not change the overlap between the states \ipi)) to 
all states he receives. This will effectively reduce the 
amount of information that Bob and Alice can exchange 
and make such a communication protocol suboptimal. 

The subspaces that contain states with different to¬ 
tal number of photons are orthogonal and we only need 
to determine how many orthogonal states can be cre¬ 


ated by Alice in each subspace by means of a two-mode 
unitary transform. The subspace spanned by vacuum 
10 ,0) is invariant under two-mode unitary transforms 
and Alice can use |0,0) as \ipi). The one-photon sub¬ 
space spanned by |0,1) and |I,0) trivially contains two 
orthogonal states e.g. IV’2,3) = =*= |1)0)) that 

can be generated using a 50/50 beamsplitter. Finally, it 
is straightforward to show that only one state orthogo¬ 
nal to 11,1) can be generated by two-mode unitary op¬ 
erations in the two-photon subspace. Hence, Alice can 
choose \'tp 4 ) = ^(|2, 0) — |0, 2)) and \tp^) = |1,1). This 
implies that Alice may prepare up to M = 5 mutually 
orthogonal states by using linear optical elements. 

We now can calculate the channel capacity for the 
above choice of the states \tpi),i = 1,5, maximizing the 
mutual information function over Bob’s unitary transfor¬ 
mation UBob2 and probabilities pQ-ipi)), subjected to the 
average photon number constraint in Eq.(l) which for 
this case reads, 

P2+P3 + 2p4 + 2p5 = 1. (12) 

We found, by solving the optimization task numerically, 
that Bob’s optimal transformation is a 50/50 beam split¬ 
ter and the channel capacity in this case amounts to 2 
bits. The optimal values for pi’s are Pi = P2 = Ps = P5 = 
0.25; p 4 = 0. Therefore, Alice can equivalently (up to a 
two-mode unitary transformation) use the following set 
of states to encode her message: 

{|^,)} = {|0,0),|0,1),|1,0),|1,1)}. (13) 

This is because Bob’s detectors are not number resolving, 
and he cannot discriminate a single photon event from a 
two-photon event. Therefore, even if Alice can prepare 
five mutually orthogonal states the amount of informa¬ 
tion that Bob can extract from them is the same as if 
Alice used the four state alphabet in Eq.(13). 

At first sight one may conclude that if Bob has 
ideal(100% efficient) detectors and additional synchro¬ 
nization information to distinguish the vacuum signal 
10 ,0) from no signal events, he could discriminate the 
states I'lpi) in Eq.(13) perfectly which implies that the 
channel capacity of the entanglement-free protocol is also 
2 bits. However, if the vacuum detection is ruled out, it 
would further reduce the set to only three states 

i.e. {\'ipi)} = {|0,1), |1,0), |1,1)}. Moreover, by combin- 

5 

ing Eq.(12) with the normalization condition '^Pi = 1 

i—1 

and setting = p 4 = 0 we immediately derive that 
P 5 = 0. So effectively this becomes a two-mode two- 
statedlV^i)} = {|0,1), |1, 0)}) protocol with the channel 
capacity of 1 bit. Therefore, under the condition of no 
vacuum detection, our entanglement-based protocol al¬ 
lows one to communicate one extra bit of information 
when compared to the best possible entanglement-free 
two-mode communication scheme. 

Lastly, let us consider what happens when the vacuum 
detection is allowed but Bob’s detectors are non-ideal. 
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FIG. 3: (Color on-line) Channel capacity as a function of the 
mean number of photons communicated by Alice. 


We show in Appendix A that in this case the difference 
in the channel capacity of the proposed entanglement- 
based protocol and its resource-equivalent entanglement- 
free analogue discussed earlier in this subsection is a pos¬ 
itive function of the detection efficiency (see Fig. (5) for 
details). Thus, we show that for realistic detectors our 
entanglement-based protocol always allows transmission 
of more information then its entanglement-free version 
with the same amount of resources used. 


C. Channel Capacity as a Function of Average 
Photon Number 

Next, let us study how much information can be com¬ 
municated in our entanglement-assisted protocol by send¬ 
ing less than one photon on average. This can be readily 
done by modifying the optimization constraint in Eq.(l). 
Note that, since Alice only uses passive optical elements, 
the average number of photons she sends to Bob is actu¬ 
ally controlled by the source i.e. 

{^Alice} T | 

where aAi,A 2 denote photon annihilation operators for 
Alice’s modes 1,2. Thus, the new optimization problem 
at hand is, 

maximize I{ip;(l)) 

S.t. {N Alice) = n, (15) 

where n is a constant G [0,1]. We used a gradient-based 
solver to optimize Eq.(15) numerically for various values 
of the average photon number n. For a fixed value of 
n = 0.05 we ran 500 independent optimizations using 


random starting points. We then selected the largest 
value of max/('(/i; (()) over these runs and used it as a 
starting point for optimizing for the next value 

of n = 0.1. By gradually varying the constraint value 
we calculated max I{'ijj; (f) as a function of n depicted in 
Fig. (3). We observe that if Alice sends Bob less then one 
photon on average then their channel capacity falls below 
two bits. However, they can achieve channel capacity 
C « 1.63 bits (the best value demonstrated to date [5]) 
by communicating « 0.68 < 1 photons on average. 


D. Communicating Larger Alphabets 

In principle, Alice may want to use larger alphabets 
than just the four-symbol one. After all, we are operating 
with the states in and naturally the question arises 
whether a physical setup exists that attains the channel 
capacity C = log 2 M for some integer M G [5,10]. To an¬ 
swer this question we modified our protocol by allowing 
Alice to perform M > 4 unitary transformations on her 
two modes. At the same time we still require Bob to mea¬ 
sure in the Fock basis {\ 4 ‘j)},j = 1,10. We numerically 
optimized the channel capacity in Eq.(2) for the cases 
of M = 5, • • • ,10 and normalized the respective values 
to the maximal theoretically attainable channel capac¬ 
ity (C = log 2 M). The results are plotted in Fig.(4). 
We notice that the maximal theoretical channel capacity 
is only achievable for the case of the two-bit alphabet 
(M=4). When Alice is trying to use M > 4 symbols in 
her alphabet the normalized channel capacity decreases. 
This is because Bob is unable to deterministically dis¬ 
criminate between the states = I,-- - ,M\M > 4 

by using projective measurements in Fock basis. 


IV. OPTIMIZATION RESULTS FOR A = 6,8 
MODES n = 2 PHOTONS STATES 

In principle, the protocol described in Sec. II can be 
used for linear optical circuits with arbitrary number of 
modes N and photons n. Here, we study two simple 
extensions with two photons (n = 2) distributed over 
N = 6 and N = 8 modes. We assume that in both cases 
Alice and Bob receive N/2 modes from the source. Our 
goal is again to solve numerically the channel capacity 
problem posed in Eq.(2). 

For the n = 2, N = 6 case the dimensionality of the 
correspondent Hilbert space is dimC = =21 

which naturally leads to the question: is it possible to 
design an entanglement-assisted communication scheme 
that provides channel capacity of log 2 21 « 4.39 bits by 
sending just three modes from Alice to Bob? The neces¬ 
sary condition for that is Alice must be able to prepare 
21 orthogonal states from an input state j'!'™) by means 
of “local” three-mode unitary transformations. However, 
we discovered numerically that Alice can at best prepare 
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FIG. 4: (Color on-line) Normalized channel capacity. Red cir¬ 
cles denote the channel capacity of our entanglement-assisted 
protocol obtained by solving Eq.(2) as a function of the num¬ 
ber of Alice’s local operations M normalized to the theoretical 
maximum of log 2 M. 


V. SUMMARY 

We discussed the problem of entanglement-assisted 
communication channel capacity gain for a linear opti¬ 
cal circuit with N modes populated by n photons. We 
discovered in the case of fV = 4, n = 2 there is a 
class of mode-entangled states that supports protocols 
with a range of channel capacity gain over correspond¬ 
ing resource-equivalent entanglement-free protocols. We 
studied numerically 6 and 8 mode extensions of the pro¬ 
tocol and provided estimates for the channel capacity in 
those cases. 
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12 orthogonal states using three-mode unitary transfor¬ 
mations. This means that the channel capacity cannot 
possibly exceed log 2 12 « 3.58 bits. Next, we numeri¬ 
cally solved the optimization task in a modified version 
of Eq.(2) for the case of M = 12 local U{3) operations 
performed by Alice and one global C/( 6 ) mode transfor¬ 
mation performed by Bob. Obtained solutions suggest 
C = 3.0 bits which implies that even if Alice can lo¬ 
cally prepare 12 orthogonal states. Bob cannot discrim¬ 
inate then deterministically by means of linear optics 
and coincidence detection. Indeed, we discovered that 
Bob can only perform a non-ambiguous detection of 8 
orthogonal states encoded by Alice. Therefore, the prac- 
tical(achievable) channel capacity in the case of n = 2 
photons in = 6 modes is limited to 3 bits. This result 
was obtained using an unconstrained optimization of the 
mutual information function. The actual mean number 
of photons per character in this scheme is 1.5 photons. 
Correspondent entanglement-free three-mode communi¬ 
cation schemes with the same photon-per-character cost 
are equivalent to a six state protocol where Alice sends 
states {| 1 , 0 , 0 ), | 0 , 1 , 0 ), | 0 , 0 , 1 ), | 1 , 1 , 0 )| 1 , 0 , 1 ), | 0 , 1 , 1 )} 
which results in the channel capacity of log 2 6 « 

2.585 61ts(assuming perfect detectors and no vacuum de¬ 
tection constraint). Therefore, the information gain in 
the n = 2 photon, N = 6 modes dense coding scheme is 
« 0.415 < 1 bits. 

Similar analysis for the case of n = 2 photons in N = 
8 modes revealed that the practical channel capacity of 
the eight-mode communication is limited to log 2 12 « 
3.58 bits (compare to the four-mode entanglement-free 
channel capacity of log 2 10 « 2.3 bits). 


Appendix A: Channel Capacity with Imperfect 
Detectors 


First, let us discuss why the detection of vacuum with 
a photon-counting detector is less efficient than the de¬ 
tection of a photon with the same detector. We assume 
that for any input state of light the detector produces 
either a “click” outcome (denoted as -I-) or a “no click” 
outcome (denoted as —). Next, we denotep(—|0) = v the 
conditional probability of the detector producing a “no 
click” outcome, given the input state on the detector was 
vacuum |0). We also introduce the conditional probabil¬ 
ity p(-|-|l) = s of the detector clicking provided the input 
state was a single photon |1). In a similar fashion we 
define the conditional probability p(-|-| 0 ) = 1 — n for the 
detector to click erroneously when the input state was | 0 ) 
and the probability p(—|l) = 1 —s for the “no click” event 
when the input contained a photon. Note that for im¬ 
perfect detectors “no click” (“click”) does not necessary 
imply that the input state was zero(one) photon. 

Assuming that states |0) and |1) have equal probabil¬ 
ities of arriving at the detector i.e. p( 0 ) = p(l) = | let 
us calculate the probability p( 0 |—) of the vacuum state 
arriving at the detector provided a ”no click” event was 
recorded. Using Bayes’ rule we obtain. 


p(o|-) 


p(-|0) ■p(O) 

PNC 


V 

1 — s + u ’ 


(Al) 


where the total probability of a “no click” event patc is 
given by 


PNC =P(-| 0 ) •p(O) -kp(-|l) •p(l) 


1 - 


(A2) 


2 







Similarly, we can compute the probability p(l|+) of a 
single photon arriving at the detector provided a ’’click” 
event was recorded as 


p(C|| 0 ))-p(| 0 )) 

p 1 +) = ----- = rd-> 

PC 1 + s — u 


(A3) 


where the total probability of a ” click” event pc is given 
by 


PC =p(+|0) •p(O) +p(+|l) •p(l) = (A4) 

Comparing Eq.(Al) and Eq.(A3) for a fixed value of v 
{v « 1 ) we observe that p(l|+) > p( 0 |—) for any finite 
efficiency detector (s < 1). For a typical state-of-the-art 
superconducting single photon detector [11] s < 0.9 and 
1 — u « 10 “^ which translates into the vacuum detection 
efficiency of p(0|—) « 0.91. On the other, hand the ef¬ 
ficiency of a single photon detection is p(l|-|-) = 0.9999. 
Therefore, detecting a single photon in a single mode is 
almost 10% more efficient than detecting vacuum. Fur¬ 
thermore, for the two-mode vacuum state | 0 , 0 ) the detec¬ 
tion efficiency is « 0.82 which is significantly less than 
the detection efficiency for states |1,1) (« 0.9998) and 
|0,1) (« 0.91). Such a disparity in detection efficiency 
suggests that linear optical schemes that rely on double 
vacuum detection will experience a more severe channel 
capacity reduction than coincidence-based schemes. 

With the previous discussion in mind, let us now 
quantify the effect of imperfect detectors onto the 
achievable channel capacity for both entanglement- 
free and entanglement-assisted communication proto¬ 
cols proposed in Section 111. In the entanglement-free 
case. Bob uses two imperfect photon detectors to re¬ 
solve between four possible states in Alice’s alphabet 
X = {|00), |01), |10), |11)}. Therefore, Bob’s alpha¬ 
bet contains four possible detection outcomes y = 

{ -, —h, -I—, -I—I-}. Assuming that Bob’s detectors have 

the same efficiency s < 1 , u « 1 we can calculate the mu¬ 
tual information function I{X\y) between Alice and Bob 
as, 


4 4 

I{X;y) = EE p{xj,yk) log 

j = l k=l 


p{xj,yk) 

v{xj)p{yk)' 


(A5) 


where the probability p{xj,yk) can be expressed in the 
matrix form. 


v^Pi u(l — v)pi 

u(l — s)p2 vsp2 

v{l-s)p3 {l-v){l-s)p3 
(1 - s)^P 4 s(l - s)p4 


u(l - v)pi 
(1 - s)(l - v)p 2 
svps 

S(1 - S)p4 


(1 - v)‘^pi 
S(1 - V)p2 
S(1 - V)P3 
S^P4 


(A 6 ) 

with the entry in the j-th row and fc-th column rep¬ 
resenting a joint probability of Alice sending a state Xj 
from X and Bob detecting an outcome yk from y. Here 
Pi,i = 1,4 is the probability of Alice sending the Ath 
state from her alphabet i.e. pi = p{xi)\li. An explicit 



FIG. 5: (Color on-line) Channel capacity difference between 
entanglement-assisted and entanglement-free communication 
protocols as a function of the photon detection probability s. 


expression for the mutual information can be readily ob¬ 
tained by combining Eqs.(A5-A6). For fixed values of s 
and V, I{X-,y) is a function of only two independent pa¬ 
rameters e.g. Pi and p 2 {p 3 and p 4 can be expressed in 
terms of pi and p 2 using Eq.(12) and probability normal¬ 
ization) and maximizing it numerically gives the channel 
capacity Cnoent for the entanglement-free communica¬ 
tion scheme. In a similar fashion we can compute the 
channel capacity for the entanglement-assisted protocol. 
In this case Alice prepares one of the four states from 
the set X = {|V'i))” • ) |l/’ 4 )} defined in Section If and 
Bob registers one of 16 possible four-mode click patterns 
y = { -, H-, ■ • • , + + ++}■ The mutual in¬ 

formation I{X\y) can now be calculated by extending 
the summation range over k from 4 to 16 in Eq.(A5) 
and introducing a correspondent 4 x 16 join probability 
matrix p(xj,yk)- Note that unlike in the entanglement- 
free protocol now I{X\y) depends on three parameters 
Pii‘ ‘ ‘ tP 3 for a fixed value of s and v. The final algebraic 
expression can be optimized numerically to find the chan¬ 
nel capacity of the entanglement-assisted protocol Cent- 
We have obtained values of AC = Cent — Cnoent nu¬ 
merically by using a random search method on a set of 
10 ^ randomly generated probability distributions {pi} for 
various values of the single photon detection probability 
s and V = 0.9999. The results are displayed on Fig.(5). 
We noticed that in the limit of perfect detectors both 
protocols reach the same channel capacity (2 bits). How¬ 
ever, with realistic detectors (s < 1) the entanglement- 
assisted protocol exhibits an information gain over its 
entanglement-free counterpart. In particular, for the best 
superconducting detectors (s « 0.9) the information gain 
is « 0.27 bits. Although, the gain is < 1 bit, it is greater 
than the actual experimental gain of 0.13 bits observed 
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